Characterization of Liaoning Cashmere Goat Transcriptome: Sequencing, De Novo Assembly, Functional Annotation and Comparative Analysis
نویسندگان
چکیده
BACKGROUND Liaoning cashmere goat is a famous goat breed for cashmere wool. In order to increase the transcriptome data and accelerate genetic improvement for this breed, we performed de novo transcriptome sequencing to generate the first expressed sequence tag dataset for the Liaoning cashmere goat, using next-generation sequencing technology. RESULTS Transcriptome sequencing of Liaoning cashmere goat on a Roche 454 platform yielded 804,601 high-quality reads. Clustering and assembly of these reads produced a non-redundant set of 117,854 unigenes, comprising 13,194 isotigs and 104,660 singletons. Based on similarity searches with known proteins, 17,356 unigenes were assigned to 6,700 GO categories, and the terms were summarized into three main GO categories and 59 sub-categories. 3,548 and 46,778 unigenes had significant similarity to existing sequences in the KEGG and COG databases, respectively. Comparative analysis revealed that 42,254 unigenes were aligned to 17,532 different sequences in NCBI non-redundant nucleotide databases. 97,236 (82.51%) unigenes were mapped to the 30 goat chromosomes. 35,551 (30.17%) unigenes were matched to 11,438 reported goat protein-coding genes. The remaining non-matched unigenes were further compared with cattle and human reference genes, 67 putative new goat genes were discovered. Additionally, 2,781 potential simple sequence repeats were initially identified from all unigenes. CONCLUSION The transcriptome of Liaoning cashmere goat was deep sequenced, de novo assembled, and annotated, providing abundant data to better understand the Liaoning cashmere goat transcriptome. The potential simple sequence repeats provide a material basis for future genetic linkage and quantitative trait loci analyses.
منابع مشابه
Clustering of Short Read Sequences for de novo Transcriptome Assembly
Given the importance of transcriptome analysis in various biological studies and considering thevast amount of whole transcriptome sequencing data, it seems necessary to develop analgorithm to assemble transcriptome data. In this study we propose an algorithm fortranscriptome assembly in the absence of a reference genome. First, the contiguous sequencesare generated using de Bruijn graph with d...
متن کاملDe novo Assembly, Characterization and Functional Annotation of Southern Hake (Merluccius australis) Transcriptome
Southern hake (Merluccius australis) is an ecological and economically important demersal fish in Chile and Argentina. Notwithstanding, genetic resource for genetic or ecological studies on this species are scarce. Consequently, here we present transcriptome sequencing results (RNA-Seq) for spleen and liver tissues with the 454 FLX titanium platform. The de novo transcriptome assembly generated...
متن کاملDe Novo Assembly, Characterization and Functional Annotation of Pineapple Fruit Transcriptome through Massively Parallel Sequencing
BACKGROUND Pineapple (Ananas comosus var. comosus), is an important tropical non-climacteric fruit with high commercial potential. Understanding the mechanism and processes underlying fruit ripening would enable scientists to enhance the improvement of quality traits such as, flavor, texture, appearance and fruit sweetness. Although, the pineapple is an important fruit, there is insufficient tr...
متن کاملNature Genetics Paper Demonstrates How Inclusion of Bionano Genomics’ Next- Generation Mapping is Essential to Create the Most Complete and Accurate De Novo Mammalian Genome Assembly
SAN DIEGO, CA – March 9, 2017 – Bionano Genomics®, Inc., a company focused on genome structure analysis, today highlighted results from a study demonstrating how combining genomic sequencing and mapping technologies, including Bionano’s next-generation mapping (NGM), produced the most continuous de novo mammalian assembly to date, of the domestic goat (Capra hircus). The paper, “Singlemolecule ...
متن کاملDe Novo Assembly of the Perennial Ryegrass Transcriptome Using an RNA-Seq Strategy
BACKGROUND Perennial ryegrass is a highly heterozygous outbreeding grass species used for turf and forage production. Heterozygosity can affect de-Bruijn graph assembly making de novo transcriptome assembly of species such as perennial ryegrass challenging. Creating a reference transcriptome from a homozygous perennial ryegrass genotype can circumvent the challenge of heterozygosity. The goals ...
متن کامل